Amendments to the Claims 

1 . (Previously Presented) A method, comprising: 
receiving input text at a client device; 

analyzing the input text to determine diphones; 

sending a request to a server for diphone waveform data based on the determined 
diphones; 

locating the requested diphone waveform data by searching a concatenative 

diphone waveform database at the server; 
generating a set of compressed diphone residuals and Linear Predictive Coding 

(LPC) coefficients by compressing results of the searched diphone 

waveform database; 
storing the set of compressed diphone residuals and the LPC coefficients in a 

compressed packet; 
transmitting the compressed packet to the client device; and 
upon receiving the compressed packet, the client device decompresses the 

compressed packet back to diphone waveform data available for use in a 

text-to-speech synthesizer. 

2. (Previously Presented) The method of claim 1 , wherein the generating of 
the set of compressed diphone residuals is performed using an encoder. 

3. (Previously Presented) The method of claim 1 , further comprising 
receiving the request from the text-to-speech synthesizer, the text-to-speech 
synthesizer residing at the client device. 

4. (Previously Presented) The method of claim 1, further comprising 
providing pitch marks to the text-to-speech synthesizer. 
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5. (Previously Presented) The method of claim 2, wherein the encoder 
comprises a G.723 encoder. 

6. (Currently Amended) A system comprising: 
a sever; 

a client device coupled the sever, the client device to 
receive input text, 

analyze the input text to determine diphones, and 

send a request to he- the server for diphone waveform data based on the 

determined diphones; 
the server to 

locate diphone waveform data by searching a concatenative diphone waveform 
database, 

generate a set of compressed diphone residuals and Linear Predictive Coding 
(LPC) coefficients by compressed diphone residuals and the LPC 
coeffients in a compressed packet, and 

transmit the compressed packet to the client device; and 

the client device to decompress the compressed packet back to diphone waveform 
data available for use in a text-to-speech synthesizer. 

7. (Previously Presented) The system of claim 6, wherein the server is further 
to generate the set of compressed diphone residuals using an encoder, the encoder 
including a G.723 encoder. 

8. (Previously Presented) The system of claim 6, wherein the server is further 
to provide pitch marks to the text-to-speech synthesizer at the client device. 
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9. (Currently Amended) The system of claim 8, wherein the text-to-speech 
synthesizer at eh-the client is further to receive the pitch marks. 

1 0. (Previously Presented) The system of claim 6, wherein the client device 
comprises a handheld device including one or more of the following: a telephone, 
a pocket computer system, and a personal digital assistant (PDA). 

1 1 . (Previously Presented) A machine-readable medium having stored thereon 
data comprising sets of instructions which, when executed by a machine, cause 
the machine to: 

receive input text at a client device; 
analyze the input text to determine diphones; 

send a request to a server for diphone waveform data based on the determined 
diphones; 

locate the requested diphone waveform data by searching a concatenative diphone 

waveform database at the server; 
generate a set of compressed diphone residuals and Linear Predictive Coding 

(LPC) coefficients by compressing results of the searched diphone 

waveform database; 
store the set of compressed diphone residuals and LPC coefficients in a 

compressed packet; 
transmit the compressed packet to the client device; and 
upon receiving the compressed packet, the client device decompresses the 

compressed packet back to diphone waveform data available for use in a 

text-to-speech synthesizer. 
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12. (Previously Presented) The machine-readable medium of claim 1 1 , wherein 
the generating of the set of compressed diphone residuals is performed using an 
encoder. 

13. (Previously Presented) The method of claim 11, wherein the sets of 
instructions which, when executed by the machine, further cause the machine to 
receive the request from the text-to-speech synthesizer, the text-to-speech 
synthesizer residing at the client device. 

14. (Previously Presented) The machine-readable medium of claim 11, wherein 
the sets of instructions which, when executed by the machine, further cause the 
machine to provide pitch marks to the text-to-speech synthesizer. 
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